Conditional [MASK] Discrete Diffusion Language Model

  • 2025-02-17 12:33:10
  • Hyukhun Koh, Minha Jhang, Dohyung Kim, Sangmook Lee, Kyomin Jung
  • 0

Abstract

Although auto-regressive models excel in natural language processing, theyoften struggle to generate diverse text and provide limited controllability.Non-auto-regressive methods could be an alternative but often producedegenerate outputs and exhibit shortcomings in conditional generation. Toaddress these challenges, we propose Diffusion-EAGS, a novel framework thatintegrates conditional masked language models into diffusion language modelsthrough the theoretical lens of a conditional Markov Random Field. In doing so,we propose entropy-adaptive Gibbs sampling and entropy-based noise schedulingto counterbalance each model's shortcomings. Experimental results show thatDiffusion-EAGS outperforms baselines and achieves the best quality-diversitytradeoff, demonstrating its effectiveness in non-autoregressive textgeneration.

 

Quick Read (beta)

loading the full paper ...